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Abstract. With the increased use of ontologies in semantically-enabled 
applications, the issue of debugging defects in ontologies has become 
increasingly important. These defects can lead to wrong or incomplete 
results for the applications. Debugging consists of the phases of detec- 
tion and repairing. In this paper we focus on the repairing phase of a 
particular kind of defects, i.e. the missing relations in the is-a hierarchy. 
Previous work has dealt with the case of taxonomies. In this work we 
extend the scope to deal with ALC ontologies that can be represented 
using acyclic terminologies. We present algorithms and discuss a system. 

This is an extended version of |18| . 

1 Introduction 

Developing ontologies is not an easy task, and often the resulting ontologies 
are not consistent or complete. Such ontologies, although often useful, also lead 
to problems when used in semantically-enabled applications. Wrong conclusions 
may be derived or valid conclusions may be missed. Defects in ontologies can 
take different forms (e.g. [IB]). Syntactic defects are usually easy to find and to 
resolve. Defects regarding style include such things as unintended redundancy. 
More interesting and severe defects are the modeling defects which require do- 
main knowledge to detect and resolve, and semantic defects such as unsatisfiable 
concepts and inconsistent ontologies. Most work up to date has focused on de- 
bugging (i.e. detecting and repairing) the semantic defects in an ontology (e.g. 
|16I15I29T5] ). Modeling defects have mainly been discussed in |2l20ll9j for tax- 
onomies, i.e. from a knowledge representation point of view, a simple kind of 
ontologies. The focus has been on defects regarding the is-a structure of the 
taxonomies. In this paper we tackle the problem of repairing the is-a structure 
of ALC ontologies that can be represented using acyclic terminologies. 
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In addition to its importance for the correct modeling of a domain, the struc- 
tural information in ontologies is also important in semantically-enabled appli- 
cations. For instance, the is-a structure is used in ontology-based search and 
annotation. In ontology-based search, queries are refined and expanded by mov- 
ing up and down the hierarchy of concepts. Incomplete structure in ontologies 
influences the quality of the search results. As an example, suppose we want to 
find articles in the MeSH (Medical Subject Headings j25j, controlled vocabu- 
lary of the National Library of Medicine, US) Database of PubMed [2Zj using 
the term Scleral Diseases in MeSH. By default the query will follow the hierar- 
chy of MeSH and include more specific terms for searching, such as Scleritis. If 
the relation between Scleral Diseases and Scleritis is missing in MeSH, we will 
miss 738 articles in the search result, which is about 55% of the original result. 
The structural information is also important information in ontology engineering 
research. For instance, most current ontology alignment systems use structure- 
based strategies to find mappings between the terms in different ontologies (e.g. 
overview in |21] ) and the modeling defects in the structure of the ontologies have 
an important influence on the quality of the ontology alignment results. 

Debugging modeling defects in ontologies consists of two phases, detection 
and repair. There are different ways to detect missing is-a relations. One way 
is by inspection of the ontologies by domain experts. Another way is to use 
external knowledge sources. For instance, there is much work on finding rela- 
tionships between terms in the ontology learning area [3^. In this setting, new 
ontology elements are derived from text using knowledge acquisition techniques. 
Regarding the detection of is-a relations, one paradigm is based on linguistics 
using lexico-syntactic patterns. The pioneering research conducted in this line 
is in [llj . which defines a set of patterns indicating is-a relationships between 
words in the text. Another paradigm is based on machine learning and statistical 
methods. Further, guidelines based on logical patterns can be used [S]. When 
the ontology is part of a network of ontologies connected by mappings between 
the ontologies, knowledge inherent in the ontology network can be used to detect 
missing is-a relations using logical derivation f2l 20ll9) . However, although there 
are many approaches to detect missing is-a relations, these approaches, in gen- 
eral, do not detect all missing is-a relations. For instance, although the precision 
for the linguistics-based approaches is high, their recall is usually very low. 

In this paper we assume that the detection phase has been performed. We 
assume that we have obtained a set of missing is-a relations for a given on- 
tology and focus on the repairing phase. In the case where our set of missing 
is-a relations contains all missing is-a relations, the repairing phase is easy. We 
just add all missing is-a relations to the ontology and a reasoner can compute 
all logical consequences. However, when the set of missing is-a relations does 
not contain all missing is-a relations - and this is the common case - there are 
different ways to repair the ontology. The easiest way is still to just add the 
missing is-a relations to the ontology. For instance. Figure [T] shows a small part 
of a pizza ontology based on [^551, that is relevant for our discussions. Assume 
that we have detected that MyPizza C FishyMeatyPizza and MyFruttiDiMare 
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C NonVegetarianPizza are missing is-a relations. Obviously, adding MyPizza t 
FishyMeatyPizza and MyFruttiDiMare C NonVegetarianPizza to the ontology 
will repair the missing is-a structure. However, there are other more interest- 
ing possibilities. For instance, adding AnchoviesTopping C FishTopping and 
ParmaHamTopping t Meat Topping will also repair the missing is-a structure. 
Another more informative wajo to repair the missing is-a structure is to add 
AnchoviesTopping C FishTopping and ParmaHamTopping C HamTopping. Es- 
sentially, these other possibilities to repair the ontology include missing is-a rela- 
tions (e.g. AnchoviesTopping C FishTopping) that were not originally detected 
by the detection algorithm^ We also note that from a logical point of view, 
adding AnchoviesTopping C Meat Topping and ParmaHamTopping C FishTop- 
ping also repairs the missing is-a structure. However, from the point of view 
of the domain, this solution is not correct. Therefore, as it is the case for all 
approaches for debugging modeling defects, a domain expert needs to validate 
the logical solutions. 

The contributions of this paper are threefold. First, we show that the problem 
of finding possible ways to repair the missing is-a structure in an ontology in 
general can be formalized as a generalized version of the TBox abduction problem 
(Section|3l). Second, we propose an algorithm to generate different ways to repair 
the missing is-a structure in ACC ontologies that can be represented using acyclic 
terminologies (Section^). Third, we discuss a system that allows a domain expert 
to repair the missing is-a structure in Section [S] We discuss the functionality 
and user interface of the repairing system and show an example run. Further, 
we discuss related work in Section \6\ and conclude in Section [71 We continue, 
however, with some necessary preliminaries in Section [2] 

2 Preliminaries 

In this paper we deal with ontologies represented in the description logic ACC 
with acyclic terminologies (e.g. ^j). Concept descriptions are defined using con- 
structors as well as concept and role names. As constructors ACC allows concept 
conjunction (C □ D), disjunction (C U D), negation (-i C), universal quantifi- 
cation (V r.C) and existential quantification (3 r.C)H In this paper we consider 
ontologies that can be represented by a TBox that is an acyclic terminology. 
An acyclic terminology is a finite set of concept definitions (i.e. terminologi- 
cal axioms of the form C = D where C is a concept name) that neither con- 
tains multiple definitions nor cyclic definitions|f| An ABox contains assertional 

^ This is more informative in the sense that the former is derivable from the latter. 
Adding ParmaHamTopping C HamTopping, also allows to derive ParmaHamTop- 
ping CI Meat Topping as the ontology already includes HamTopping C Meat Topping. 

^ Therefore, the approach discussed in this paper can also be seen as a detection 
method that takes already found missing is-a relations as input. 

^ C and D represent concepts, and r represents a role. 

* We observe that the TBox in Figure \T\ is not an acyclic terminology as there are 
statements of the form A O C. However, it is possible to create an equivalent TBox 
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Pizza □ T 
PizzaToppiiig C T 
hasTopping □ T X T 
AnchovicsTopping □ PizzaTopping 
McatTopping □ PizzaTopping 
HaniTopping □ McatTopping 
ParmaHamTopping □ PizzaTopping 
FishTopping C PizzaTopping D -iMeatTopping 

TomatoTopping □ PizzaTopping □ —iMeatTopping □ -iFishTopping 
GarlicTopping t PizzaTopping □ -iMeatTopping □ ^FishTopping 

MyPizza ^ Pizza □ 3 hasTopping. AnchovicsTopping □ 3 hasTopping. ParmaHamTopping 

FishyMcatyPizza ^ Pizza D 3 hasTopping. FishTopping □ 3 hasTopping. McatTopping 

MyFruttiDiMare ^ Pizza □ 3 hasTopping. AnchovicsTopping 

n 3 hasTopping. GarhcTopping fl 3 hasTopping. TomatoTopping 

n V hasTopping. (AnchovicsTopping U GarhcTopping U TomatoTopping) 

VcgetarianPizza ^ Pizza □ -> 3 hasTopping. FishTopping □ -i 3 hasTopping. McatTopping 

NonVegctarianPizza ^ Pizza R — iVegctarianPizza 



Fig. 1. A pizza ontology. 



AnchovicsTopping ^ PizzaTopping □ AnchoviesT opping 
McatTopping ^ PizzaTopping □ AleatTopping 
HamTopping ^ McatTopping □ H amT opping 
ParmaHamTopping ^ PizzaTopping □ ParmaHamTopping 
FishTopping ^ PizzaTopping □ -iMeatTopping □ FishTopping 

TomatoTopping ^ PizzaTopping D -iMeatTopping H -iFishTopping □ TomatoTopping 
GarhcTopping ^ PizzaTopping D -iMeatTopping □ -iFishTopping □ GarlicTopping 
MyPizza ^ Pizza □ 3 hasTopping. AnchovicsTopping □ 3 hasTopping. ParmaHamTopping 
FishyMcatyPizza ^ Pizza D 3 hasTopping. FishTopping □ 3 hasTopping. McatTopping 
MyFruttiDiMare ^ Pizza □ 3 hasTopping. AnchovicsTopping 

n 3 hasTopping. GarhcTopping □ 3 hasTopping. TomatoTopping 
n V hasTopping. (AnchovicsTopping U GarhcTopping U TomatoTopping) 
VcgetarianPizza ^ Pizza □ -i 3 hasTopping. FishTopping fl -i 3 hasTopping. McatTopping 
NonVegctarianPizza ^ Pizza fl —iVegctarianPizza 



Fig. 2. A pizza ontology - Acyclic ACC terminology. 

knowledge, i.e. statements about the membership of individuals (interpreted as 
elements in the domain) to concepts (C(i)) as well as relations between individ- 
uals (r(i,j))|l A knowledge base contains a TBox and an ABox. A model of the 
TBox/ABox/knowledge base satisfies all axioms of the TBox/ABox/knowledge 
base. A knowledge base is consistent if it does not contain contradictions. 

An important reasoning service is the checking of (un)satisfilibility of con- 
cepts (a concept is unsatisfiable if it is necessarily interpreted as the empty set 
in all models of the TBox/ABox/knowledge base, satisfiable otherwise). A TBox 
is incoherent if it contains an unsatisfiable concept. 

Checking satisfiability of concepts in ACC can be done using a tableau-based 
algorithm (e.g. [T]). To test whether a concept C is satisfiable such an algorithm 

that is a acyclic terminology by replacing the statements of the form ACC with A 
= C r\ A where A is new atomic concept. See Figure [J] 

i and j represent individuals. In the completion graph in Section 21 statements of the 
form C(i) are written as i : C, and statements of the form r(i,j) are written as irj. 
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n-rule: if the ABox contains (Ci R C2)(x), but it does not contain both Ci(x) and C2(x), 

then these arc added to the ABox. 
U-rule: if tlie ABox contains (Ci U C2)(x), but it contains neither Ci(x) nor C2(x), 

then two ABoxes are created representing the two choices of adding Ci(x) or adding C2(x). 
V-rulc: if the ABox contains (V r.C)(x) and r(x,y), but it does not contain C(y), 

then tliis is added to the ABox. 
zl-rulc: if the ABox contains (3 r.C)(x) but there is no individual z such that r(x,z) and C(z) arc in the ABox, 

then r(x,y) and C(y) with y an individual name not occurring in the ABox, are added. 



Fig. 3. Transformation rules (e.g. [T]). 

starts with an ABox containing the statement C(x) where x is a new individual 
and it is usually assumed that C is normalized to negation normal form. It then 
applies consistency-preserving transformation rules to the ABox (Figure [3]). The 
□-, V- and 3-rules extend the ABox while the U-rule creates multiple ABoxes. The 
algorithm continues applying these transformation rules to the ABoxes until no 
more rules apply. This process is called completion and if one of the final ABoxes 
does not contain a contradiction (we say that it is open), then satisfiability 
is proven, otherwise unsatisfiability is proven. One way of implementing this 
approach is through completion graphs which are directed graphs in which every 
node represents an ABox. Application of the U-rule produces new nodes with 
one statement each, while the other rules add statements to the node on which 
the rule is applied. The ABox for a node contains all the statements of the node 
as well as the statements of the nodes on the path to the root. Satisfiability is 
proven if at least one of the ABoxes connected to a leaf node does not contain 
a contradiction, otherwise unsatisfiability is proven. 

In this paper we assume that an ontology O is represented by a knowledge 
base containing a TBox that is an acyclic terminology and an empty ABox. In 
this case reasoning can be reduced to reasoning without the TBox by unfolding 
the definitions. However, for efficiency reasons, instead of running the previously 
described satisfiability checking algorithm on an unfolded concept description, 
the unfolding is usually performed on demand within the satisifiability checking 
algorithm. It has been proven that satisfiability checking w.r.t. acyclic termi- 
nologies is PSPACE-complete in ACC [33]. 

3 An abduction problem 

In our setting, a missing is-a relation in an ontology O represented by a knowledge 
base KB, is an is-a relation between named concepts that is not derivable from 
the KB, but that is correct according to the intended domain. We assume that 
we have a set M of missing is-a relations (but not necessarily all) for O. Then, the 
is-a structure of O can be repaired by adding is-a relations (or axioms of the form 
CUD) between named concepts to O such that the missing is-a relations can 
be derived from the repaired ontology. This repair problem can be formulated 
as a generalized version of the TBox abduction problem. 
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Definition 1 Let KB be a knowledge base in C, and for 1 < i < m: Ci, Di 

are concepts that are satisfiable w.r.t. KB. such that KB U { Ci Di \ 1 < 
i < m} is coherent. A solution to the generalized TBox abduction problem for 
(KB, {(Ci, Di) I 1 < i < m}) is any finite set Sqt — {Cj ^ Hj \ j < n} of 
TBox assertions in C such that V i: KB U Sgt |= C Di . The set of all such 
solutions is denoted by Sgt (KB, {(Ci, Di )\ 1 < i < m}). 

In our setting the language C is ACC and £' only allows named concepts. We 
say that any solution in Sgt(KB, {(C^, Di)}i) is a repairing action. A repairing 
action is thus a set of is-a relations. When m — 1, this definition of the gener- 
alized TBox abduction problem coincides with the TBox abduction problem as 
formulated in [S] , which therefore deals with repairing one missing is-a relation. 
Further, we have that Sgt(KB, {{d, D,)}z) = n, Sgt(KB, {(C„ D,)})! 
This shows that solving a generalized TBox abduction problem can be done by 
solving m TBox abduction problems and then taking the intersection of the so- 
lutions. In practice, however, this leads to a number of difficulties. First, it would 
mean that a domain expert will need to choose between large sets of repairing 
actions for all the missing is-a relations at once, and this may be a very hard 
task. Further, due to the size of the solution space, even generating all solutions 
for one TBox abduction problem is, in general, infeasible. Also, many of the so- 
lutions will not be interesting for a domain expert (e.g. IIH])- For instance, there 
may be solutions containing is-a relations that do not contribute to the actual 
repairing. Some solutions may introduce unintended equivalence relations. 

A common way to limit the number of possible solutions is to introduce con- 
straints, e.g. minimality. Our proposed algorithm generates solutions to a TBox 
abduction problem that are minimal in the sense that repairing actions only 
contain necessary information for repairing the missing is-a relations. Further, 
we check the generated solutions for the introduction of incoherence. 

4 Algorithm for generating repairing actions 
4.1 Basic algorithm 

The basic algorithm for generating repairing actions for a set of given missing is-a 
relations for an ontology is shown in Figure SI In this first study we assume that 
the existing structure in the ontology is correct. Also, as stated in the definition 
in Section [31 adding the missing is-a relations to the ontology does not lead to 
incoherence. 

In step 1 a set of repairing actions is generated for each missing is-a relation 
(thereby solving a TBox abduction problem for each missing is-a relation). In 

® A solution for all missing is-a relations is also a solution for each missing is-a relation 
and therefore, in the intersection of the solutions concerning one missing is-a relation 
at the time. Further, a solution in the intersection of the sets of solutions for each 
of the missing is-a relations, allows, when added to the knowledge base, to derive all 
missing is-a relations, and is therefore, a solution for the generalized TBox abduction 
problem as well. 
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Input: The ontology O represented by knowledge base KB and a set of missing is-a relations M. 

Output: Set of repairing actions Rep{A4). 

Algorithm 

1. For every missing is-a relation Ai C Bi in M: 

1.1 G — completion graph after running a tableaux algorithm with Ai D ^Bi as input on KB; 

1.2 Leaf-ABoxes — get ABoxcs of the leaves from the completion graph G; 

1.3 For every open ABox A (E Leaf-ABoxes: 

1.3.1 Ra = 0; 

1.3.2 For every individual Xj in A: 

1.3.2.1 Pos^^ — {P \ Xj : P G -4 A P is a named concept}; 

1.3.2.2 Neg^. = {N | Xj : --N e A f\ N is & named concept}; 

1.3.2.3 Ra = Ra'J {P Q N \ P e Pos^ - A N e Neg^ . }; 

1.4 flep(Ai,Bi) = 0; ^ ' 

1.5 As long as there are different choices: 

1.5.1 Create a repairing action ra by choosing one element from each set Ra\ 

1.5.2 Rep{Ai,Bi) = Rep{Ai,Bi) U {ra}; 

1.5.3 Remove reduncancy in Rep{Ai ^ Bi)\ 

1.5.4 Remove incoherent solutions from Rep{Ai , Bi); 

2. Rep(M) = { M }; 

3. As long as there are different choices: 

3.1 Create a repairing action rp by choosing one element from each Rep(Ai^ Bi) 
and taking the union of these elements; 

3.2 Rep(M) = Rep(M) U { rp }; 

4. Remove reduncancy in R.ep{M); 

5. Remove incoherent solutions from Rep{M); 



Fig. 4. Basic algorithm for generating repairing actions using completion graph. 



step 1 . 1 we run the satisfiability checking algorithm with unfolding on demand as 
described in Section [51 on KB with input Ai □ ^Bi, and we collect the ABoxes of 
the leaves in step 1.2. As C is a missing is-a relation, it cannot be derived 
from KB and thus the completion graph will have open leaf ABoxes. We then 
generate different ways to close these ABoxes in step 1.3. For each individual x 
in an open leaf ABox we collect the concepts in the statements of the form x:A 
in PoSx and the concepts in the statements of the form x:^B in Negx where A 
and B are named concepts. The ABox can be closed \i A \— B \s added to the 
ontology for any A G PoSx and B £ Negx- Indeed, with this extra information 
x:A could be unfolded and x:B would be added to the ABox, and this gives a 
contradiction with x:^B which was already in the ABox. A repairing action for 
the missing is-a relation is then a set of A Q B that closes each open leaf ABox. 
In step 1.5 we generate such sets by selecting one such axiom per open leaf ABox 
and remove redundancy based on the sub-set relation. If a repairing action is a 
super-set of another repairing action, it is removed. Further, we remove solutions 
that introduce incoherence. 

In step 2 the repairing actions set for all missing is-a relations is initialized 
with the set of missing is-a relations. Therefore, there will always be at least 
one repairing action. In step 3 additional repairing actions are generated by 
combining repairing actions for the individual missing is-a relations. As repairing 
actions are sets, there are no duplicates in a repairing action. In step 4 we remove 
redundancy based on the sub-set relation. In step 5 we remove solutions that 
introduce incoherence. We note that there always will be at least one solution 
that does not introduce incoherence (i.e. Mor a sub-set of M). 
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{MyPizza C FishyMeatyPizza} 
{Pizza C FishyMeatyPizza}* 

{AnclioviesTopping CI Fisli Topping, AncliovicsTopping □ McatTopping}* 
{PizzaTopping C Fisli Topping, AnchovicsTopping □ McatTopping}* 
{ParinaHaniTopping □ Fisli Topping, AnchovicsTopping □ McatTopping} 
{AnchovicsTopping □ FishTopping, PizzaTopping □ McatTopping}* 
{PizzaTopping [Z FishTopping, PizzaTopping [Z McatTopping}* 
{ParmaHaniTopping C FishTopping, PizzaTopping C McatTopping}* 
{AnchovicsTopping [I FishTopping, ParmaHaniTopping C McatTopping} 
{PizzaTopping C Fisli Topping, ParmaHaniTopping (Z McatTopping}* 
{ParmaHamTopping C FishTopping, ParmaHamTopping □ McatTopping}* 



Fig. 5. Non-redundant repairing actions for MyPizza CI FishyMeatyPizza. 

As an example, consider the acyclic terminology in Figure [5] equivalent to 
the ontology in Figure [T] and M = {MyPizza C FishyMeatyPizza, MyFruttiDi- 
Mare C NonVegetarianPizza}. For the missing is-a relation MyPizza □ Fishy- 
MeatyPizza the completion graph obtained after running the satisfiability check 
on My Pizzan-iFishy Meaty Pizza is shown in Figure [T^ The completion graph 
contains 17 nodes of which 11 are leaf nodes. Of these leaf nodes 6 are closed 
and the repairing actions will be based on the 5 open leaf nodes. The compu- 
tation of the PoSxj , Negxj and for the leaf ABoxes is given in Figure [M) 
Closing all open leaf ABoxes will lead to 11 non- redundant repairing actions (see 
Figure [5]) of which 8 lead to incoherence (marked by * in Figure [S]) . The remain- 
ing repairing actions are {MyPizza t FishyMeatyPizza}, {AnchovicsTopping C 
FishTopping, ParmaHamTopping C McatTopping } and {ParmaHamTopping C 
FishTopping, AnchoviesTopping C McatTopping}. Observe that in this example 
we removed the repairing actions containing a concept of the form A. This is 
because whenever x:A occurs in an ABox, then also x:A is in that ABox, as x:A 
only can appear by unfolding A in x:A. Therefore, whenever A is a choice for 
repairing, then also ^ is a choice for repairing, and we can ignore the A choice, 
which does not relate to a concept in the original ontology. 

In a similar way, for missing is-a relation MyFruttiDiMare C NonVegetar- 
ianPizza, we find the following non-redundant coherence-preserving repairing 
actions: {MyFruttiDiMare C NonVegetarianPizza}, {AnchovicsTopping C Fish- 
Topping} and {AnchovicsTopping t McatTopping}. After combining the repair- 
ing actions for the individual missing is-a relations, and removing redundancy 
and incoherence-introducing repairing actions, we obtain the following 5 solu- 
tions for the repairing of the missing is-a structure: {MyPizza □ FishyMeatyP- 
izza, MyFruttiDiMare C NonVegetarianPizza}, {AnchovicsTopping C FishTop- 
ping, ParmaHamTopping C McatTopping}, {ParmaHamTopping □ FishTop- 
ping, AnchoviesTopping C McatTopping}, {MyPizza □ FishyMeatyPizza, An- 
choviesTopping C FishTopping}, and {MyPizza □ FishyMeatyPizza, AnchoviesTop- 
ping C McatTopping}. 

A repairing action is a set of statements of the form A \Z B with A and 
B named concepts. We note that for acyclic terminologies, after adding such a 
statement, the resulting TBox is not an acyclic terminology anymore. If this is 
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needed, then instead of adding A \Z B the following should be done. If there 
is no definition for A yet in the TBox, then add A ^ B Fl A with A a new 
atomic concept. If there is already a definition for A in the TBox, say A = C 
then change this definition to A = B Fl C. For instance, to add the repairing ac- 
tion {AnchoviesTopping t FishTopping, ParmaHamTopping t MeatTopping} 
to the acyclic terminology in Figure [21 we have to change two definitions. An- 
choviesTopping = PizzaTopping n AnchoviesTopping becomes AnchoviesTop- 
ping = PizzaTopping n AnchoviesTopping □ FishTopping, and ParmaHam- 
Topping PizzaTopping □ ParmaHamTopping becomes ParmaHamTopping 
= PizzaTopping n ParmaHamTopping □ MeatTopping. 

4.2 Solution properties 

We prove that the algorithm is correct. In the following A and B are named 
concepts. 

Lemma 1 // an ABox in the algorithm in Figure^ contains x:A and x.—iB for 
some individual x, then adding A\— B to the ontology will close this ABox. 

Proof. Adding A C _B to an acyclic terminology can be done in the following 
way. If there is no definition for A yet in the TBox, then add A = B U A with 
A a new atomic concept. If there is already a definition for A in the TBox, say 
A = C then change this definition to A = _B □ C. In both cases it will allow 
an unfolding step in the ABox for x:A such that x.B Fl A or x:B □ C is added 
to the ABox. A further application of the Fl-rule will then also add x:B to the 
ABox. This leads to a contradiction with x:^B and closes the ABox. 

Lemma 2 Let G be the completion graph after running the tableaux algorithm 
with Ai n -iBi as input on KB. Let Ge be the completion graph when running the 
tableaux algorithm with Ai □ -iBi as input on KB^, the knowledge base for the 
extended ontology where (i) a statement A = B r\ A is added, or (ii) a statement 
A = C is changed to A = B n C . Then for every open leaf ABox Ae in Ge, 
there is a corresponding open leaf ABox A in G such that A C Ae- 

Proof. We observe that the statements that are valid in the original ontology are 
still valid in the extended ontology. Also, all transformation rules that could be 
applied in the tableaux algorithm on the original ontology can also be applied 
on the extended ontology. Further, additionally, for the extended ontology, there 
may be an unfolding based on the extension (i.e. when x:A appears in an ABox, 
then by unfolding x:B (and in case (i) also x:A) should be added), as well as ad- 
ditional applications of transformation rules based on x.B and its consequences. 
Therefore, assume that Ae is an open leaf ABox in Gg. Then Ae will contain 
statements that are a result of transformations that could have been peformed 
in the original ontology as well as statements that can only be derived based on 
x:B. 
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A node n tagged with 6 changes its tag to cr if: 

1. node n has an incoming unfold edge from a cr-tagged node p 

2. node n has an incoming D edge from a CT-taggcd node p 

3. node n has an incoming 3^ edge from a cj-taggcd node p 

4. node n has an incoming zlc edge from a CT-tagged node p 

5. node n has an incoming edge from a fj-taggcd node pi 

and an incoming Vc edge from a cr-taggcd node Pc 

6. node n has an incoming L) edge from a CT-tagged node p 

and all other nodes with an incoming U edge from p arc tagged with 9 

7. node n has an incoming U edge from a cr-tagged node p 

and there are no other nodes with an incoming U edge from p 



Fig. 6. Tagging rules. 

In order to identify which statements in Ae belong to which of these two cat- 
egories, we built a dependency graph DG = (V, E) where the vertices represent 
statements from Ae, and there is an edge from node rii related to statement 
to node rij related to statement Tj if tj could be a direct result of applying some 
transformation rule or unfolding on r^. We label the edges with U, 3i (from 
a node related to x:3r.D to a node related to xry), 3c (from a node related to 
x:3r.D to a node related to y:D), Vi (from a node related to xry to a node related 
to y:C), Vc (from a node related to x:Vr.C to a node related to y:C) and unfold 
depending on which transformation rule was applied. 

Further, we tag the nodes in the dependency graph using a and 6. When the 
tagging is finished, the statements related to the nodes tagged with a are a result 
of transformations that could have been performed in the original ontology. The 
algorithm for tagging nodes is given in Figure [T] Initially, the node related to 
x:Ai n -^Bi is tagged with cr. Further, for case (i) all nodes related to z:A and z:B 
n A for some individual z are tagged with 9 and this tag can never be changecQ; 
All other nodes are initially tagged with 9 and collected in the set 0-nodes. We 
then traverse the dependency graph and change 9 tags into a tags according 
to a number of rules. Essentially, the tagging simulates the construction of a 
completion graph (step 4) by first using n, V, 3 and unfolding rules until no 
more such rules are applicable (essentially an ABox in the completion graph). 
This is achieved in step 4.1. by applying the tagging rules 1-5 in Figure|6l For case 
(ii) we need to deal with a special case when unfolding x:A (step 4.1.2). When no 
more such rules can be applied, but a U-rule could be applied, a choice is made 
and the procedure is repeated (essentially a child ABox in the completion graph 
is created and then if needed, further descendant ABoxes). This is achieved by 
using tagging rules 6 and 7 in FigurelHlin step 4.4. Step 4 finishes when all nodes 
have a tags or no more tagging rules can be applied to nodes with 9 tags. 

We now show that the set of statements related to a-tagged nodes is an 
open leaf ABox ^ in G (with possible renaming of individuals) . It is clear that 
A C Ae- Further, A contains statements that can be derived by running the 
tableatrx algorithm on the original ontology with Ai n -iBi as input. (Without 



A does not occur in the original ontology and thus these statements are not included 
in any ABox related to the original ontology. 
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Input: The initial dependency graph DG. 
Output: Tagged dependency graph. 
Algorithm 

1. Tag the node related to xiA^ □ —^Bi with a. 

l(i). Only for case (i): Tag all nodes related to z:A and z:B fl A for some indivdual z with 0: 

2. Tag all remaining nodes with and add them to the set 0-nodes; 

3. Visited :^ 0; 

4. While 6-nodes ^ and ^-nodes ^ Visited, do: 

4.1 While ^-nodes ^ and ^-nodes ^ Visited, do: 

4.1.1 Select a node n G 0-nodes \ Visited and add n to Visited; 

4.1.2 If n is related to x:BnC for some x 

and rule 1 is applicable for unfolding a cr-tagged node related to x:A (only case (ii) 
then 

4.1.2.1 If there is no node related to x:C 
then 

4.1.2.1.1 Add a new node related to x:C and connect it with outgoing H-edges to 
all nodes that are connected with incoming Fl-edges to the node related to x:BnC, 
except the node x:B; 

4.1.2.2 Tag the node related to x:C with a; 
else 

4.1.2.1' If any of the tagging rules 1-5 in Figure[7]is applicable, 
then 

4.1.2.1'.l Change the tag of n to (t; 
4. 1.2.1'. 2 Remove n from ^-nodes; 
4. 1.2.1'. 3 Visited :^ 0; 

4.2 Visited :^ 0; 

4.3 Still-to-visit :— true; 

4.4 While ^-nodes ^ and ^-nodcs ^ Visited and Still-to-visit — true, do: 

4.4.1 Select a node n G 5-nodes \ Visited and add n to Visited; 

4.4.2 If tagging rule 6 or 7 in Figure[7]is applicable, 
then 

4.4.2.1 Change the tag of n to cr; 

4.4.2.2 Remove n from ^-nodes; 

4.4.2.3 Visited :^ 0; 
4.4.2.3 Still-to-visit :— false; 

5. Return the tagged graph; 



Fig. 7. Algorithm for tagging the dependency graph. 



loss of generality, individuals may be renamed.) The dependency graph shows 
which transformation rules and unfoldings can be performed. A is also a leaf 
ABox in the completion graph obtained by running the tableaux algorithm on 
the original ontology with Ai D as input. If A were not a leaf ABox in that 
completion graph, then there would be other statements that could be added 
through the application of the transformation rules or unfolding. However, in 
that case, these transformation rules or unfoldings could also be applied to Ae 
and these statements could be added to Ae^ which would contradict the fact that 
Ae is a leaf ABox. Finally, as v4 C and as Ae does not contain a contradiction 
(as it is an open leaf ABox in Ge), also A does not contain a contradiction. 



Consider the example ontology in Figure 8(a) and assume we run the tableaux 
algorithm with input x:A n -iF. The completion graph is given in Figure 9(a) 
Figure |9(b)| shows a completion graph for running the tableaux algorithm with 
input A n -iF on the ontology that extends the original ontology with the axiom 
A = B n A (Figure |8(b)[ ). The completion graph Ge contains two open leaf 



ABoxes, ABox 1.2.1 and ABox 1.2.2. We observe that ABox 1.2.1 in Figure 9(b) 
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B = C n (D U E) 

D = F 

F = G n H 



(a) Original ontol- 
ogy- 



B = 


C n (D U E) 


D = 


F 


F = 


G n H 


A = 


BnA 



(b) Extended ontol- 
ogy- 



Fig. 8. Ontology - original and extended. 



ABox 1; 

(1) X 

(2) x: 

(3) ,T 

(4) X 



An ^ F 
A (n-rule (1)) 
T F (n-nile (1)) 

^ G U ^ H (unfold F and normalize) 



ABox 1.1: 




(1-4) 




(5) ,r; ^ G ( 


U-rule (4)) 



ABox 1.2: 

0-4) 

(6) x: ^ H (U-riile (4)1 



(11) 



(1)* 
[a) n 

(3) B n A (un 

(4) B^(n-rale (3)J 

(5) A (n-rule (3)) 

(6) M C n (D U E) (uiifni.l 
[Tl 1, C (n-ruie 10)) 

IS) D u E |n-(uie (Bl) 
13) x: -.F (n-rule HI] 
HO) t: G U - H liiii ' ■ ■ 




(a) Completion graph after running the (b) Completion graph after running the 
tableaux algorithm with input A fl -iF on tableaux algorithm with input A H on 



the original ontology in Figure 8(a 



the extended ontology in Figure 8(b) 



Fig. 9. Completion graphs 



contains all literals from ABox 1.1 in Figure [9(a)] a nd ABox 1.2.2 in Figure [9(b)] 



contains all literals from ABox 1.2 in Figure 9(a) 



Figure 10(a) shows an initial dependency graph for ABox 1.2.1. In this exam- 
ple all but x:A n-iF can be the direct result of the application of a transformation 
rule or unfolding of only one other statement. In general this is not case, though. 
Initially, the node related to (1) is tagged as a. All other nodes are tagged with 
6, and the tag of the nodes related to (3) and (5) cannot be changed. In the first 
iteration the tags of the nodes related to (2), (9) and (10) will be changed to 
a using the tagging rules 1 and 2. Further, the tag for the node related to (14) 
is changed to a using tagging rule 7. Then, there will be no more changes in 
the tagging and the statements related to the nodes with a tags in Figure [10(b)[ 



constitute the statements in ABox 1.1. in Figure 9(a) 



The previous lemma also holds when adding sets of axioms of the form (i) A 
= B n A or changing a sets of axioms (ii) of the form A = C to axioms of the 
form A = B nC. 



Lemma 3 Let G be the completion graph after running the tableaux algorithm 
with Ai n -iBi as input on KB. Let Ge be the completion graph when running the 
tableaux algorithm with Ai n -^Bi as input on KB^, the knowledge base for the 
extended ontology where (i) statements of the form A ^ B n A are added, or 
(ii) statements of the form A = C are changed to A ^ B Fl C . Then for every 
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(a) Initial dependency graph for ABox (b) Tagged dependency graph for ABox 



1.2.1. in Figure 9(b) 



1.2.1. in Figure 9(b) 



Fig. 10. Dependency graphs. 



open leaf ABox Ae in Ge, there is a corresponding open leaf ABox A in G such 
that A (= Ae- 

Proof. Let — {di, ...,6'„} be the changes to the original ontology, i.e. 9j 
represents either an addition of an axiom of the form A ^ B Fl A or a change of 
an axiom of the form A = C to an axiom of the form A = B U C. 

For any possible order 6''^, . . . , 6*^ of the elements in 0, let 0^ = {0'^, . . .,^^1 

C 6* be the set of the first k changes according to the chosen order. Let KBf be 
the knowledge base for the ontology that is constructed by applying the changes 
in 0'f. to the original ontology. We note that for k = n there is only one possible 
set of changes, i.e. 0'^ — and KBf " — KBg. 

According to the previous lemma, we then know that for any open leaf ABox 
in the completion graph obtained by running the tableaux algorithm with Ai □ 

-^Bi as input on KBf ''^^ with k < n, there is an open leaf ABox in the completion 
graph obtained by running the tableaux algorithm with Ai n -^Bi as input on 
KBf , such that the latter is contained in the former. 

Therefore, for every open leaf ABox Ae in Ge, there is a chain of open leaf 
ABoxes An-i, • • •, Ai, A, such that Ak is an open leaf ABox in the completion 
graph obtained by running the tableaux algorithm with Ai n -^Bi as input on 

KBf*", A is an open leaf ABox in G, and A C Ai, Ai C A2, . ■ ■ Ai-i ^ Ae- 

Lemma 4 Any element of Rep{Ai, Bi) repairs the missing is-a relation Ai C 

Proof Let {Pi C A^i, . . ., Pm t iV,„} G Rep{Ai,Bi)- Let KB be the knowledge 
base of the original ontology. Then, we want to prove that Ai □ Bi can be derived 
from the knowledge base of the ontology extended with {Pi □ A^i, . . ., Pm E 

Nm}- 
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Adding {Pi C A^i, . . ., t Nm} to an ontology means adding statements 
of the form Pj = Nj n Pj with Pj a new atomic concept or changing statements 
of the form Pj = C to Pj = Nj n C. Assume G and Ge are the completion 
graphs obtained by running the tableaux algorithm with Ai n -iBi as input on 
the knowledge bases of the original (with added missing is-a relations except 
A-iQBi) and the extended ontology respectively. 

We now prove that every leaf ABox in Ge is closed. Let us assume that there 
exists an open ABox Ae in Ge . According to the previous lemma, we know then 
that for Ae there is a corresponding leaf ABox A in G such that AQ Ae- Further, 
there is at least one individual x and named concepts P and N such that x:P 
and x'.-i N are statements in the ABox collects all possible is-a relations 

between such pairs P and N. In the algorithm one such pair is chosen to be 
added to the ontology. Therefore, there is an individual x and named concepts 
P and N such that x:P and x:^ N are statements in Ae, and such that P C 
N was added to the ontology. By lemma [T]^e is then closed which contradicts 
with our assumption. This shows that all leaf ABoxes are closed and thus Ai C 
Bi can be derived from the knowledge base of the ontology extended with {Pi 

Theorem 1 Any element of Rep{M) repairs the missing is-a relations in M. 

Proof. The previous lemma guarantees that adding any element from Rep{Ai, Bi) 
to KB allows us to derive Ai C Bi. Therefore, adding one element from each 
Rep{Ai,Bi) to KB guarantees that each missing is-a relation Ai C Bi is deriv- 
able. 

The proposed algorithm returns minimal solutions for every missing is-relation. 
In our setting a solution is minimal if it contains only necessary information for 
repairing a missing is-a relation. 

Definition 2 Let KB be a knowledge base representing ontology O and A 'i- B 
a missing is-a relation. A repairing action {Ci tZ Di, . . . , C„ C D,i\ is said to 
be minimal if it holds that KByj{Ci C Di, . . . , C„ t ^ A □ P and there is 
no {El t Fi,...,Em t Frn} C {Ci □ Pi, . . . , C„ □ P„} such that KB U {Pi 
t Pi,...,P„ ± Frn} ^AtB. 

Theorem 2 The basic algorithm for generating repairing actions for missing 
is-a relations in Figure [7] produces minimal repairing actions according to the 
Definition\^for each missing is-a relation in the set M. 

Proof. Trivial. This kind of redundancy is removed in step 1.5.3 and step 4. 

* If the ABox is open then there must be at least one such situation. For instance, we 
know that x:Ai and x^.-^ Bi are in the ABox A. 
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4.3 Optimization 

The basic algorithm may produce many redmidant solutions already in step 1. 
For instance, if the root node in the completion graph contains x:A and x:-iB 
with A and B named concepts, then A Q B appears in every open leaf ABox 
as a possible way of closing that ABox. To deal with this issue we modify the 
algorithm such that at each node with an open ABox it generates PoSxj and 
Negxj ■ However, PoSxj and Negxj only contain the concepts that are not already 
in a PoSxj and Negx^ related to an ancestor node. For instance, if A G PoSx is 
related to the root node, then A will not appear in any PoSx related to another 
node. For each ABox we then generate a set i?^ = { P — ^ } where P G 
PoSx for the current node or any ancestor node, N G Negx for the current node 
or any ancestor node, and at least one of P and N occurs in the current node. 
This allows us to reduce the redundancy in step 1. An open leaf ABox can now 
be closed by using an element from Rj^ from the leaf node or from any ancestor 
node. When generating repairing actions in step 1.5 we then make sure that 
when an element related to a non-leaf node is chosen, that no additional element 
from any descendant of that node is selected. For instance, if any element from 
the root's Ra is chosen, then no other elements should be chosen. 

As an example, let us reconsider the computation of the repairing actions 
related to MyPizza C Fishy MeatyPizza for the acyclic terminology in Figure [2j 
In the optimized version of the algorithm, we compute Px , Nx and i?^ for every 
open ABox (see Figure [T5|). The root ABox (ABox 1 in Figure [T5)) has statements 
of the forms Xj:A and Xj'.-iA and we thus create Pos^; — {MyPizza, Pizza}, 
Negx = {FishyMeatyPizza}, PoSj, = {AnchoviesTopping, AnchoviesTopping, 
PizzaTopping}, Negj, = 0, PoSz = {ParmaHamTopping, ParmaHamTopping, 
PizzaTopping}, Negj = 0. This leads to Rm — {MyPizza C FishyMeatyPizza, 
Pizza C FishyMeatyPizza} for ABox 1. We also know now that any element 
of the Rj\i will close all leaf nodes. For Abox 1.2 the new statements of the 
form Xj'.A and Xj'.-iA (i.e. not occurring in an ancestor) are y:-iFishTopping 
and z:-iFishTopping. Therefore, we create Pos^ — 0, Neg^ = 0, PoSj^ = 0, Ncg^ 
= {FishTopping}, Pos^ = 0, Neg^ = {FishTopping} for Abox 1.2. Rai.2 for 
Abox 1.2 contains the new ways to close this ABox (i.e. ways not occurring in 
ancestor nodes) and contains AnchoviesTopping □ FishTopping, PizzaTopping 
C FishTopping, and ParmaHamTopping □ FishTopping. We now know that any 
of these will close all leaf ABoxes of Abox 1.2. After all Ra are computed for 
all open ABoxes, a leaf node can be closed using an element from its Ra or any 
Ra related to an ancestor node. For instance, ABox 1.2.2.2 can be closed using 
any element from -R^i.2.2.2, -Ryti.2.2, Rai.2 and Rai- When creating repairing 
actions we then make sure that when an element related to a non-leaf node is 
chosen, that no additional element from any descendant of that node is selected 
to close any leaf ABoxes that are descendants of that non-leaf node. For instance, 
if AnchoviesTopping □ FishTopping related to ABox 1.2 is chosen, then no 
additional element is chosen to close any leaf ABoxes that are descendants of 
Abox 1.2. 
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4.4 Extension 

The algorithm can be extended to generate additional repairing actions for every 
individual missing is-a relation. In step 1.5 if A C i? is used as one of the is-a 
relations in a repairing action then also 5* C T where S" is a super-concept of A 
and T is a sub-concept of B could be used. Therefore, the extended algorithm 
generates two sets of concepts for every is-a relation A Q B in a repairing action, 
Source set containing named super-concepts of A and Target set containing 
named sub-concepts of B. Further, to not introduce non-validated equivalence 
relations where in the original ontology there are only is-a relations, we remove 
the super-concepts of B from Source, and the sub-concepts of A from Target. 

Alternative repairing actions for a repairing action {Ai C Bi, . . ., An Q 
Bn} are then repairing actions {Si t Ti, . . ., S'„ C r„} such that {Si,Ti) G 
Source{Ai, Bi) x Target{Ai, Bi). This extension allows the algorithm to produce 
more informative repairing actions. 

Next, we prove the correctness of the proposed extension. 

Theorem 3 If a missing is-a relation A B is repaired by a repairing action 
{Ci C -Di, . . . , C„ C Dn} then A \Z B will also be repaired by {Si C T^jVi : 
l..n : Si G Source{Ci, Di) /\Ti ^ Target{Ci, Di)} . 

Proof. Let KB be the knowledge base of the original ontology. As {Ci C Di, 
■ ■ ■, Cn E Dn} is a repairing action for A C _B, we know that KB U {Ci C Di, 

. . ., Cn tDn}^A tB. 

To prove that A t i? is repaired by {Si t Ti\ii : l..n : Si G Source{Ci, Di) A 
Ti G Target{Ci, Di)}, we need to show that A \Z B can be derived from 
the knowledge base of the extended ontoloy KB U {Si C Ti\Vi : l..n : Si G 
Source{Ci, Di) A G Target{Ci, Di)}. 

As Si G Source{Ci,Di), we know that KB \= d Si. Further as Ti G 
Target{C^,Di), we know that KB ^ T, t A- Therefore, KB U {Si QT,\yi : l..n : 
S^ G Source{Ci, Di)ATi G Target{C^, D,)} ^ t S^ A S^ tTi AT, t Di, and 
therefore, KB U {S^ t Ti\\/i : l..n : 5, G Source{Q, D,) AT, G Target{C,, D,)} 

^at A. 

This shows that KB U {S, ± T,|Vi : l..n : S, G Source{C,, D,) A Ti G 
Target{Ci, Di)} entails all statements that KB U {Ci C Di, Cn Z Dn} 
entails and thus also A \— B. 

As an example, consider the repairing action {AnchoviesTopping C Fish- 
Topping, ParmaHamTopping t Meat Topping} for the missing is-a relations M 
= {MyPizza C FishyMeatyPizza, MyFruttiDiMare C NonVegetarianPizza} for 
the ontology in Figure [21 as computed using the basic algorithm. The Source set 
for AnchoviesTopping C FishTopping contains all named super-concepts of An- 
choviesTopping that are not super-concepts of FishTopping, i.e. {AnchoviesTopping, 
PizzaTopping} \ {FishTopping, PizzaTopping} — {AnchoviesTopping}. The Tar- 
get set for AnchoviesTopping C FishTopping contains all named sub-concepts of 
FishTopping that are not sub-concepts of AnchoviesTopping, i.e. {FishTopping} 
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Fig. 11. Screenshot - Validating is-a relations in a repairing action. 



\ {AnchoviesTopping} = {FishTopping}. For ParmaHamTopping □ Meat Top- 
ping the Source set is {ParmaHaniTopping, PizzaTopping} \ {Meat Topping, 
PizzaTopping} = {ParmaHamTopping}, and the Target set is {Meat Topping, 
HamTopping} \ {ParmaHamTopping} = {Meat Topping, HamTopping}. In this 
small example, using Source and Target sets, we would obtain one additional re- 
pairing action {AnchoviesTopping [I FishTopping, ParmaHamTopping C Ham- 
Topping}. 

5 Implementation 

We have implemented a system that supports the user to repair missing is-a re- 
lations. In our system the user loads the ontology and the missing is-a relations 
from the 'File' menu. The missing is-a relations are then shown in a drop-down 
list (e.g. MyPizza t FishyMeatyPizz^^ in the second drop-down list in Figure 
[TT|) . The user then chooses a missing is-a relation to repair^ At any time the 
user can switch between different missing is-a relations. Once a missing is-a rela- 
tion is chosen for repairing the user generates repairing actions for it by clicking 

® In the system C C D is shown as C — s- D. 
As we repair one is-a relation at a time, there may be some redundancy in the 
solutions. 
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Fig. 12. Screenshot - Repairing using Source and Target sets. 



the 'Generate Repairing Actions' button. This covers step 1 in Figure[l]and was 
implemented in Java using PeUet (version 2.3.0) [ST. The satisfiabiUty checker 
in Pehet was modified in order to extract full completion graphs. Furthermore, 
to increase performance and account for higher level of non-determinism, on- 
tologies are first passed through Pellint before running the algorithm. The 
computed repairing actions are shown in the drop-down list under the button. 
Each repairing action consists of one or more is-a relations. In Figure [11] the 
user has chosen to repair MyPizza □ FishyMeatyPizza and the system has gen- 
erated three repairing actions that do not introduce incoherence ({MyPizza t 
FishyMeatyPizza}, {AnchoviesTopping C FishTopping, ParmaHamTopping C 
MeatTopping}, and {AnchoviesTopping C MeatTopping, ParmaHamTopping C 
FishTopping}). To repair this missing is-a relation the user has to succesfuUy 
deal with at least one of the repairing actions, i.e. add all is-a relations (or more 
informative is-a relations from Source and Target) in at least one of the repairing 
actions to the ontology. In Figure [11] the user has chosen the second repairing 
action. When repairing actions are added to the ontology, they will make the 
missing is-a relation derivable. However, it is not guaranteed that all the is- 
a relations in the repairing actions are also valid with respect to the domain. 
Therefore, a domain expert needs to validate the is-a relations in the repairing 
actions. When the user clicks the 'Validate is-a Relations in Repairing Action' 
button, a pop- up window (Figure [TT]) appears where the user can mark all the 
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is-a relations that are correct with respect to the domain model. The repairing 
actions for all missing is-a relations and the ontology are updated according to 
the results of the validation. If an is-a relation is validated as incorrect according 
to the domain, all repairing actions that contain this incorrect is-a relation, for 
this and for all other missing is-a relations, are removed from the lists of the 
repairing actions. When an is-a relation is validated as correct it is added to the 
ontology and it is marked as correct in all repairing actions for all missing is-a 
relations. When all is-a relations in the current repairing action are validated as 
correct, they are shown in the last drop-down list (Figure [12]) . Now the user can 
repair them one by one. 

For each is-a relation within the repairing action the Source and Target sets 
are generated and displayed on the left and the right hand sides, respectively, 
within the panel under the drop-down lists (Figure [T^ . Both panels have zoom 
control and can be opened in a separate window. The concepts in the is-a relation 
under consideration are highlighted in red, existing asserted and inferred is-a 
relations are shown in grey, not yet repaired missing is-a relations in blue and 
is-a relations that were previously added for repairing missing is-a relations in 
black. In order to repair the is-a relation the user has to choose one concept 
from each of the sets and click the 'Repair' button. In Figure [T^ the user has 
chosen to repair ParmaHamTopping C Meat Topping with ParmaHamTopping 
t HamTopping. Upon repair the ontology is updated, i.e. the chosen is-a relation 
(ParmaHamTopping t HamTopping) is added to the ontology. A red label next 
to the drop-down list shows the status (Repaired or Not Repaired) of the selected 
is-a relation. When all is-a relations within a repairing action are repaired the 
missing is-a relation is marked as repaired ('This relation is already repaired' 
label in Figure [T^. The other repairing actions are still available for review by the 
user. These may give information about other possible missing is-a relations. The 
user can also revoke repairing actions (through the 'Revoke Repairing Actions' 
button). If the user revokes the repairing action, the missing is-a relation may 
become not repaired again and the is-a relations within the repairing action are 
marked as not repaired. All changes in the ontology are revoked and the user 
can start repairing this missing is-a relation again in the way just described. 

Example run As an example run, consider the ontology in Figure[T]and missing 
is-a relations MyPizza C FishyMeatyPizza and MyFruttiDiMare □ NonVegetar- 
ianPizza. After loading the ontology and the missing is-a relations, we can choose 
a missing is-a relation to repair. Assume we choose MyFruttiDiMare □ NonVeg- 
etarianPizza and click the 'Generate Repairing Actions' button. The system will 
show three repairing actions in the drop-down list: {MyFruttiDiMare C NonVeg- 
etarianPizza}, {AnchoviesTopping C FishTopping}, and {AnchoviesTopping t 
Meat Topping}. We can choose to deal with AnchoviesTopping C Meat Topping 
and validate this to be incorrect with respect to the domain. In this case all 
repairing actions containing this is-a relation will be removed. We could then 
choose AnchoviesTopping C FishTopping and validate it to be correct. All is-a 
relations in this repairing action (i.e. AnchoviesTopping □ FishTopping) are val- 
idated to be correct and thus we can continue with the repair of AnchoviesTop- 
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ping □ FishTopping. In this small example the Source set only contains An- 
choviesTopping and the Target set only contains FishTopping. Therefore, we 
click on these concepts and the 'Repair' button. AnchoviesTopping C FishTop- 
ping will be marked as repaired and also the missing is-a relation MyFruttiDi- 
Mare □ NonVegetarianPizza will be marked as repaired. 

We can then start repairing MyPizza C FishyMeatyPizza. The system would 
have generated as repairing actions that do not introduce incoherence {MyPizza 
C FishyMeatyPizza}, {AnchoviesTopping □ FishTopping, ParmaHamTopping 
C MeatTopping} and {AnchoviesTopping □ MeatTopping, ParmaHamTopping 
t FishTopping}. However, as we earlier already validated AnchoviesTopping t 
MeatTopping to be incorrect with respect to the domain, the third repairing 
action has already been removed. When we choose the second repairing ac- 
tion, AnchoviesTopping C FishTopping is already marked as correct (because 
of earlier validation) and only ParmaHamTopping C MeatTopping needs to be 
validated. We validate this as correct and then choose to repair it. The Source 
set in this small example contains ParmaHamTopping and the Target set con- 
tains HamTopping and MeatTopping. Although we can add ParmaHamTopping 
C MeatTopping, it is more informative (and correct with respect to the domain) 
to add ParmaHamTopping □ HamTopping. We therefore choose the latter. All 
is-a relations in this repairing action are then repaired and thus also MyPizza C 
FishyMeatyPizza. 

6 Related work 

Debugging ontologies Detecting missing is-a relations can be done in a num- 
ber of ways (see Section [T|). There is, however, not much work on the repairing 
of missing is-a structure. In |20) we addressed this in the setting of taxonomies. 

There is more work on the debugging of semantic defects. Most of it aims at 
identifying and removing logical contradictions from an ontology. Standard rea- 
soners are used to identify the existence of a contradiction, and provide support 
for resolving and eliminating it [9] . In [29] minimal sets of axioms are identified 
which need to be removed to render an ontology coherent. In |16ll5j strategies 
are described for repairing unsatisfiable concepts detected by reasoners, explana- 
tion of errors, ranking erroneous axioms, and generating repair plans. In |10j the 
focus is on maintaining the consistency as the ontology evolves through a formal- 
ization of the semantics of change for ontologies. In i24j and ,13; the setting is 
extended to repairing ontologies connected by mappings. In this case, semantic 
defects may be introduced by integrating ontologies. Both works assume that 
ontologies are more reliable than the mappings and try to remove some of the 
mappings to restore consistency. The solutions are often based on the compu- 
tation of minimal unsatisfiability-preserving sets or minimal conflict sets. The 
work in [28] further characterizes the problem as mapping revision. Using be- 
lief revision theory, the authors give an analysis for the logical properties of the 
revision algorithms. Another approach for debugging mappings is proposed in 
[52] where the authors focus on the detection of certain kinds of defects and 
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redundancy. The approach in |14j deals with the inconsistencies introduced by 
the integration of ontologies, and unintended entailments validated by the user. 

Work that deals with both modeling and semantic defects includes [5] where 
the authors propose an approach for detecting modeling and semantic defects 
within an ontology based on patterns and antipatterns. Some suggestions for 
repairing are also given. In [19] we provided a method to detect and repair 
wrong and missing is-a structure in taxonomies connected by mappings. 

A different setting is the area of modular ontologies where the ontologies 
are connected by directional mappings and where knowledge propagation only 
occurs in one direction. Regarding the detection of semantic defects, within a 
framework based on distributed description logics, it is possible to restrict the 
propagation of local inconsistency to the whole set of ontologies (e.g. [30]). 

Abductive reasoning in description logics In ^ four different abductive 
reasoning tasks are defined - (conditionalized) concept abduction, ABox abduc- 
tion, TBox abduction and knowledge base abduction. Concept abduction deals 
with finding sub-concepts. Abox abduction deals with retrieving abductively in- 
stances of concepts or roles that, when added to the knowledge base, allow the 
entailment of a desired ABox assertion. Knowledge base abduction includes both 
ABox and TBox abduction. 

Most existing approaches for DL abduction focus on ABox and concept ab- 
duction and are mostly based on existing proof techniques such as semantic 
tableaux and resolution. Since the number of possible solutions is very large, 
the approaches introduce constraints. The work in [T7] proposes a goal-oriented 
approach where only actions which contribute to the solution are chosen in the 
proof procedures. The method is both complete and sound for consistent and 
semantically minimal solutions. Since the set of solutions can contain some in- 
consistent and non-minimal solutions additional checks are required. A practical 
approach for ABox abduction, based on abductive logic programming, was pro- 
posed in [7]. In order to use existing abductive logic programming systems it 
is necessary to do a transformation to a plain Datalog program. The solutions 
are consistent and minimal given a set of abduciles. However, the approach does 
not guarantee completeness since the translation to a Datalog program is ap- 
proximate and in some cases a solution would not be found. An approach for 
conditionalised concept abduction that uses a variation of the semantic tableaux 
and two labeling functions was proposed in [J- The two labeling functions T() 
and F() represent true and false formulas in a tableaux where the solutions are 
formed from concepts which have at least one constraint in F() of every open 
branch. This choice is non-deterministic and can be used to select solutions based 
on some criteria. The algorithm also contains a consistency check which implies 
that produced solutions are always consistent. 

There has not been much work related to TBox abduction, which is the 
most relevant abduction problem for this paper. The work in [T^] proposes an 
automata-based approach to TBox abduction using abduciles representing ax- 
ioms that can appear in solutions. It is based on a reduction to the axiom pin- 
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pointing problem which is then solved with automata-based methods. A PTIME 
algorithm is proposed for the language £L. 

All of the presented approaches to description logic abduction work with 
relatively inexpressive ontologies, such as ZL and ALC. However, some recent 
work [B] describes a type of conditionalised concept abduction called structural 
abduction which is applicable to S'H. 

7 Conclusion 

This paper formalized repairing the is-a structure in AdC acyclic terminologies 
as a generalized TBox abduction problem, provided a solution based on semantic 
tableaux, and discussed a system. 

There are a number of interesting aspects for future work. First, we intend 
to extend the algorithm to deal with more expressive ontologies. Further, it may 
be useful to consider also solutions introducing incoherence as they may lead to 
the detection of other kinds of modeling defects such as wrong is-a relations. In 
this case we do not assume anymore that the existing structure is correct. As a 
domain expert may need to deal with many possible solutions, other useful ex- 
tensions are mechanisms for ranking the generated repairing actions and among 
those recommending repairing actions, e.g. based on domain knowledge. 
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Fig. 13. Completion graph for MyPizza n -^FishyMeatyPizza. 



Repairing missing is-a relations in ALC ontologies 



25 



ABox 1.1: CLOSED 

ABox 1.2.1: CLOSED 

ABox 1.2.2.1: CLOSED 

ABox 1.2.2.2: 

PoSa; — {MyPizza, Pizza}, 
Nega; = {FishyMeatyPizza} 

PoSy = {AnchoviesTopping, AnchoviesTopping , PizzaTopping, Meat Topping, MeatTopping}, 

Ncgy ^ {FishTopping} 

Poss — {ParmaHamTopping, ParmaHamTopping, PizzaTopping, Me at Topping, MeatTopping}, 
Neg^ ^ {FishTopping} 

i?_4 — {MyPizza C FishyMeatyPizza, Pizza □ FishyMeatyPizza, AnchoviesTopping C FishTopping, 

PizzaTopping t FishTopping, ParmaHamTopping C FishTopping, MeatTopping C FishTopping} 
ABox 1.2.2.3: 

Pos^ — {MyPizza, Pizza}, 
Ncg.i: — {FishyMeatyPizza} 

PoSy — {AnchoviesTopping, AnchoviesTopping . PizzaTopping, MeatTopping, MeatTopping}, 
Ncgy — {FishTopping} 

PoSs — {ParmaHamTopping, ParmaH arriTopping . PizzaTopping}, 
Ncg2 — {FishTopping, FishTopping} 

Rj\^ — {MyPizza C FishyMeatyPizza, Pizza □ FishyMeatyPizza, AnchoviesTopping C FishTopping, 

PizzaTopping t FishTopping, ParmaHamTopping □ FishTopping, MeatTopping □ FishTopping} 
ABox 1.2.3.1: CLOSED 
ABox 1.2.3.2: 
Posa; = {MyPizza, Pizza}, 
Nega; = {FishyMeatyPizza} 

PoSy = {AnchoviesTopping, AnchoviesTopping , PizzaTopping}, 
Ncgy — {FishTopping, FishTopping} 

PoSz — {ParmaHamTopping, ParmaHamTopping, PizzaTopping, MeatTopping, MeatTopping}, 

Ncg2 — {FishTopping} 

— {MyPizza C Fishy]McaLyF*izza, Pizza C FishyMeatyPizza, AnchoviesTopping C FishTopping, 

PizzaTopping C FishTopping, ParmaHamTopping C FishTopping, MeatTopping C FishTopping} 
ABox 1.2.3.3: 

Pos^ ^ {MyPizza, Pizza}, 
Ncg.j; — {FishyMeatyPizza} 

PoSy — {AnchoviesTopping, AnchoviesTopping , PizzaTopping}, 
Ncgy — {FishTopping, FishTopping} 

PoSs — {ParmaHamTopping, ParmaH amToppirig, PizzaTopping}, 
Ncgs — {FishTopping, FishTopping} 

Rj^ —{MyPizza □ FishyMeatyPizza, Pizza t FishyMeatyPizza, AnchoviesTopping □ FishTopping, 

PizzaTopping t FishTopping, ParmaHamTopping □ FishTopping} 
ABox 1.3.1: CLOSED 

ABox 1.3.2.1: CLOSED 

ABox 1.3.2.2: 

Posa; = {MyPizza, Pizza}, 
Nega; = {FishyMeatyPizza} 

PoSy = {AnchoviesTopping, AncJior/c'-'I'oppl-nfi. PizzaTopping}, 
Negy = {MeatTopping,MeatTopp/;//(y} 

Posz = {ParmaHamTopping, ParmaHamTopping, PizzaTopping}, 
Negz = {MeatTopping, MeotTopping} 

={MyPizza C FishyMeatyPizza, Pizza C FishyMeatyPizza, AnchoviesTopping C MeatTopping, 
PizzaTopping t MeatTopping, ParmaHamTopping □ MeatTopping} 



Fig. 14. Creating for the Leaf ABoxes related to MyPizza □ FishyMeatyPizza. 
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ABox 1: 

Posx = {MyPizza, Pizza}, Ncg^, =- f FislivMratvPizza} 

PoSy = {AnchoviesTopping, AncJiora:.^I\>f>^>uitj, PizzaTopping}, Negy = 
Posz = {ParmaHamTopping, ParmaHamTopping, PizzaTopping}, Neg^ = 
RJ^ = {MyPizza C FishyMeatyPizza, Pizza C FishyMeatyPizza} 



ABox 1.1: CLOSED 



ABox 1.2: 

Pos^ = 0, Nog^ = 

Posj, = 0, Nog„ = {FisliTopping} 

Posj = 0, Nogj = {FishTopping} 

-R^ —{AnchoviesTopping □ FishTopping, PizzaTopping C FishTopping, ParmaHamTopping C FishTopping} 



ABox 1.2.1: CLOSED 



ABox 1.2.2: 

PoSa: = 0, Nega, = 

PoSy = {MeatTopping, MeatTopping}, Negy = 

Pos^ = 0, Negj- = 

-R^ ={ MeatTopping t FishTopping} 



ABox 1.2.2.1: CLOSED 



ABox 1.2.2.2: 

Pos^ = 0, Ncg^ = 

PoSj, = 0, Negy = 

Pos^ — {MeatTopping, MeatTopping}, Neg^ 
ii^ = 



ABox 1.2.2.3: 

PoS;j = 0, Ncg^ = 

PoSj, = 0, Nog„ = 

P0S2 — 0, Neg2 — {FishTopping} 



ABox 1.2.3: 

Pos^ = 0, Nog,^ = 

PoSj, = 0, Nogjj = {FishTopping} 

Pos, = 0, Nog, = 



ABox 1.2.3.1: CLOSED 



ABox 1.2.3.2: 

Pos^ = 0, Negx = 

Pos^ = 0, Negy = 

Pos, = {MeatTopping, MeatTopping}, Neg, — 
fl^ ={MeatTopping H FishTopping} 



ABox 1.2.3.3: 

Posa, = 0, Negx = 

Pos„ = 0, Negy = 

Pos, = 0, Neg, = {FishTopping} 

Ra = 1> 



ABox 1.3: 

Pos^ = 0, Ncg^ = 

PoSj, = 0, Nog„ = {MeatTopping} 

PoSj = 0, Nog, = {MeatTopping} 

Ra —{AnchoviesTopping □ MeatTopping, PizzaTopping C MeatTopping, ParmaHamTopping C MeatTopping} 



ABox 1.3.1: CLOSED 



ABox 1.3.2: 

Pos^ = 0, Neg^ = 

PoSy = 0, Negy = {MeatTopping} 
Posz = 0, Neg^ = 



ABox 1.3.2.1: CLOSED 



ABox 1.3.2.2 

Pos^ = 0, Neg. 
PoSj, = 0, Ne, 
Pos, = 0, Neg, = {MeatTopping} 



y 



Fig. 15. Creating for the ABoxes related to MyPizza C FishyMeatyPizza - opti- 
mized algorithm 



